PRSeg: A Lightweight Patch Rotate MLP Decoder for Semantic Segmentation

نویسندگان

چکیده

The lightweight MLP-based decoder has become increasingly promising for semantic segmentation. However, the channel-wise MLP cannot expand receptive fields, lacking context modeling capacity, which is critical to In this paper, we propose a parametric-free patch rotate operation reorganize pixels spatially. It first divides feature map into multiple groups and then rotates patches within each group. Based on proposed operation, design novel segmentation network, named PRSeg, includes an off-the-shelf backbone Patch Rotate containing Dynamic Blocks (DPR-Blocks). DPR-Block, fully connected layer performed following Module (PRM) exchange spatial information between pixels. Specifically, in PRM, split reserved part rotated along channel dimension according predicted probability of Channel Selection (DCSM), our only part. Extensive experiments ADE20K, Cityscapes COCO-Stuff 10K datasets prove effectiveness approach. We expect that PRSeg can promote development

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing the incoming features with filters or pooling operations at multiple rates and multiple effective fields-of-view, while the latter networks can capture sharper object boundaries by gradual...

متن کامل

SegNet: A Deep Convolutional Encoder-Decoder Architecture for Scene Segmentation

We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG...

متن کامل

Geodesic Patch-Based Segmentation

Label propagation has been shown to be effective in many automatic segmentation applications. However, its reliance on accurate image alignment means that segmentation results can be affected by any registration errors which occur. Patch-based methods relax this dependence by avoiding explicit one-to-one correspondence assumptions between images but are still limited by the search window size. ...

متن کامل

RDFa: Lightweight Semantic Enrichment for Hypertext Content

RDFa is a syntactic format that allows RDF triples to be integrated into hypertext content of HTML/XHTML documents. Although a growing number of methods or tools have been designed attempting at generating or digesting RDFa, comparatively little work has been carried out on finding a generic solution for publishing existing RDF data sets with the RDFa serialisation format. This paper proposes a...

متن کامل

Lightweight Semantic Prototyper for Conceptual Modeling

While much research work was devoted to conceptual model quality validation techniques, most of the existing tools in this domain focus on syntactic quality. Tool support for checking semantic quality (correspondence between the conceptual model and requirements of a domain to be engineered) is largely lacking. This work introduces a lightweight model-driven semantic prototyper to test/validate...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology

سال: 2023

ISSN: ['1051-8215', '1558-2205']

DOI: https://doi.org/10.1109/tcsvt.2023.3271523